AITopics | Menomonee Falls

Collaborating Authors

Menomonee Falls

Physics of Language Models: Part 3.1, Knowledge Storage and Extraction

arXiv.org Artificial IntelligenceDec-26-2023

Large language models (LLMs) can store a vast amount of world knowledge, often extractable via question-answering (e.g., "What is Abraham Lincoln's birthday?"). However, do they answer such questions based on exposure to similar questions during training (i.e., cheating), or by genuinely learning to extract knowledge from sources like Wikipedia? In this paper, we investigate this issue using a controlled biography dataset. We find a strong correlation between the model's ability to extract knowledge and various diversity measures of the training data. $\textbf{Essentially}$, for knowledge to be reliably extracted, it must be sufficiently augmented (e.g., through paraphrasing, sentence shuffling) $\textit{during pretraining}$. Without such augmentation, knowledge may be memorized but not extractable, leading to 0% accuracy, regardless of subsequent instruction fine-tuning. To understand why this occurs, we employ (nearly) linear probing to demonstrate a strong connection between the observed correlation and how the model internally encodes knowledge -- whether it is linearly encoded in the hidden embeddings of entity names or distributed across other token embeddings in the training text. This paper provides $\textbf{several key recommendations for LLM pretraining in the industry}$: (1) rewrite the pretraining data -- using small, auxiliary models -- to provide knowledge augmentation, and (2) incorporate more instruction-finetuning data into the pretraining stage before it becomes too late.

accuracy, augmentation, knowledge, (15 more...)

arXiv.org Artificial Intelligence

2309.14316

Country:

North America > United States > New York > New York County > New York City (0.14)
North America > United States > New York > Kings County > New York City (0.14)
North America > United States > Wisconsin > Waukesha County > Menomonee Falls (0.14)
(22 more...)

Genre: Research Report > New Finding (1.00)

Industry: Education (0.45)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Options Forecast Based on a Self-learning Algorithm: Returns up to 22.27% in 7 Days

#artificialintelligenceDec-6-2020, 20:22:43 GMT

This forecast is part of the Options Package, as one of I Know First's algorithmic trading tools. Package Name: Options Recommended Positions: Long Forecast Length: 7 Days (11/27/2020 – 12/5/2020) I Know First Average: 10.75% I Know First's State of the Art Algorithm accurately forecasted 10 out of 10 trades in this Options Package for the 7 Days time period. KSS was our best stock pick this week a return of 22.27%. AA and CCL followed with returns of 16.5% and 13.56% for the 7 Days period. The package had an overall average return of 10.75%, providing investors with a 8.84% premium over the S&P 500's return of 1.91% during the period.

forecast, option forecast, self-learning algorithm, (3 more...)

#artificialintelligence

Country: North America > United States > Wisconsin > Waukesha County > Menomonee Falls (0.07)

Industry: Banking & Finance > Trading (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

Best Stocks To Buy Based on Machine Learning: Returns up to 27.02% in 3 Days

#artificialintelligenceDec-6-2020, 20:22:42 GMT

This forecast is part of the Top 10 Stocks Package, as one of I Know First's systematic trading tools. Package Name: Stock Forecast & S&P500 Forecast Recommended Positions: Long Forecast Length: 3 Days (12/1/2020 – 12/4/2020) I Know First Average: 13.95% The algorithm correctly predicted 10 out 10 of the suggested trades in the Stock Forecast & S&P500 Forecast Package for this 3 Days forecast. The prediction with the highest return was KSS, at 27.02%. CCL and MT also performed well for this time horizon with returns of 17.37% and 17.18%, respectively.

forecast, machine learning, stock forecast, (5 more...)

#artificialintelligence

Country: North America > United States > Wisconsin > Waukesha County > Menomonee Falls (0.07)

Industry: Banking & Finance > Trading (0.76)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

Can Tesla's Elon Musk revolutionize tunneling?

USATODAY - Tech Top StoriesAug-14-2017, 06:10:09 GMT

An image released by Tesla Motors, is a conceptual design rendering of the Hyperloop passenger transport capsule. If it were anyone else, the notion of digging hundreds of miles of tunnels to create a new subterranean transportation network under congested cities would seem like pure science fiction. But the dreamer behind this vision is Elon Musk, the billionaire innovator who has already shown with his Tesla electric cars and SpaceX rockets that he thinks big and doesn't wait for others to transform fantasy into reality. Once again, Musk is aiming to shake up an arcane industry not used to outside-the-box thinking and yet potentially ripe for disruption: the underground world of tunneling. He could use tunneling to achieve his moonshot goal of clearing up Los Angeles traffic.

artificial intelligence, social media, tunnel, (14 more...)

USATODAY - Tech Top Stories

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.26)
North America > United States > New York (0.05)
North America > United States > District of Columbia > Washington (0.05)
(6 more...)

Industry:

Transportation > Passenger (1.00)
Transportation > Infrastructure & Services (1.00)
Transportation > Ground > Rail (0.72)
Transportation > Ground > Road (0.70)

Technology:

Information Technology > Communications > Social Media (0.50)
Information Technology > Artificial Intelligence (0.35)

Add feedback